AITopics | support function

Collaborating Authors

support function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On model selection consistency of penalized M-estimators: a geometric theory

Jason D. Lee, Yuekai Sun, Jonathan E. Taylor

Neural Information Processing SystemsFeb-18-2026, 21:25:37 GMT

Neural Information Processing Systems http://nips.cc/

consistency, model selection consistency, penalty, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Better Neural Network Expressivity: Subdividing the Simplex

Bakaev, Egor, Brunck, Florestan, Hertrich, Christoph, Stade, Jack, Yehudayoff, Amir

arXiv.org Artificial IntelligenceNov-10-2025

This work studies the expressivity of ReLU neural networks with a focus on their depth. A sequence of previous works showed that $\lceil \log_2(n+1) \rceil$ hidden layers are sufficient to compute all continuous piecewise linear (CPWL) functions on $\mathbb{R}^n$. Hertrich, Basu, Di Summa, and Skutella (NeurIPS'21 / SIDMA'23) conjectured that this result is optimal in the sense that there are CPWL functions on $\mathbb{R}^n$, like the maximum function, that require this depth. We disprove the conjecture and show that $\lceil\log_3(n-1)\rceil+1$ hidden layers are sufficient to compute all CPWL functions on $\mathbb{R}^n$. A key step in the proof is that ReLU neural networks with two hidden layers can exactly represent the maximum function of five inputs. More generally, we show that $\lceil\log_3(n-2)\rceil+1$ hidden layers are sufficient to compute the maximum of $n\geq 4$ numbers. Our constructions almost match the $\lceil\log_3(n)\rceil$ lower bound of Averkov, Hojny, and Merkert (ICLR'25) in the special case of ReLU networks with weights that are decimal fractions. The constructions have a geometric interpretation via polyhedral subdivisions of the simplex into ``easier'' polytopes.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Artificial Intelligence

2505.14338

Country:

North America > United States (0.46)
Europe > Germany (0.28)

Genre: Research Report (0.64)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

On Condorcet's Jury Theorem with Abstention

Meir, Reshef, Ghalme, Ganesh

arXiv.org Artificial IntelligenceOct-22-2025

The well-known Condorcet Jury Theorem states that, under majority rule, the better of two alternatives is chosen with probability approaching one as the population grows. We study an asymmetric setting where voters face varying participation costs and share a possibly heuristic belief about their pivotality (ability to influence the outcome). In a costly voting setup where voters abstain if their participation cost is greater than their pivotality estimate, we identify a single property of the heuristic belief -- weakly vanishing pivotality -- that gives rise to multiple stable equilibria in which elections are nearly tied. In contrast, strongly vanishing pivotality (as in the standard Calculus of Voting model) yields a unique, trivial equilibrium where only zero-cost voters participate as the population grows. We then characterize when nontrivial equilibria satisfy a version of the Jury Theorem: below a sharp threshold, the majority-preferred candidate wins with probability approaching one; above it, both candidates either win with equal probability.

artificial intelligence, equilibrium, probability, (16 more...)

arXiv.org Artificial Intelligence

2510.18062

Genre: Research Report (0.50)

Industry: Government > Voting & Elections (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

On model selection consistency of penalized M-estimators: a geometric theory

Jason D. Lee, Yuekai Sun, Jonathan E. Taylor

Neural Information Processing SystemsOct-3-2025, 06:23:03 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, penalty, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Gravity Well Echo Chamber Modeling With An LLM-Based Confirmation Bias Model

Jackson, Joseph, Lapin, Georgiy, Thompson, Jeremy E.

arXiv.org Artificial IntelligenceSep-9-2025

Social media echo chambers play a central role in the spread of misinformation, yet existing models often overlook the influence of individual confirmation bias. An existing model of echo chambers is the "gravity well" model, which creates an analog between echo chambers and spatial gravity wells. We extend this established model by introducing a dynamic confirmation bias variable that adjusts the strength of pull based on a user's susceptibility to belief-reinforcing content. This variable is calculated for each user through comparisons between their posting history and their responses to posts of a wide range of viewpoints. Incorporating this factor produces a confirmation-bias-integrated gravity well model that more accurately identifies echo chambers and reveals community-level markers of information health. We validated the approach on nineteen Reddit communities, demonstrating improved detection of echo chambers. Our contribution is a framework for systematically capturing the role of confirmation bias in online group dynamics, enabling more effective identification of echo chambers. By flagging these high-risk environments, the model supports efforts to curb the spread of misinformation at its most common points of amplification.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.03832

Country: North America > United States (0.28)

Genre: Research Report > Promising Solution (0.47)

Industry: Media > News (0.95)

Technology:

Information Technology > Communications > Social Media (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.31)

Add feedback

A Further Preliminaries

Neural Information Processing SystemsAug-19-2025, 03:47:07 GMT

The main object of interest in the present work is the convex floating body . Let us first recall the definition of the convex floating body. The floating body has the following desirable properties. The floating body is a natural high dimensional statistical construction. Approximate Differential Privacy Throughout the paper we referred to a similar notion to "pure" In this Section we establish our main meta-theorem, Theorem 10.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learned enclosure method for experimental EIT data

Sippola, Sara, Rautio, Siiri, Hauptmann, Andreas, Ide, Takanori, Siltanen, Samuli

arXiv.org Artificial IntelligenceJul-8-2025

Electrical impedance tomography (EIT) is a non-invasive imaging method with diverse applications, including medical imaging and non-destructive testing. The inverse problem of reconstructing internal electrical conductivity from boundary measurements is nonlinear and highly ill-posed, making it difficult to solve accurately. In recent years, there has been growing interest in combining analytical methods with machine learning to solve inverse problems. In this paper, we propose a method for estimating the convex hull of inclusions from boundary measurements by combining the enclosure method proposed by Ikehata with neural networks. We demonstrate its performance using experimental data. Compared to the classical enclosure method with least squares fitting, the learned convex hull achieves superior performance on both simulated and experimental data.

artificial intelligence, hull, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.3934/ammc.2025008

2504.11512

Country: Europe > Finland (0.28)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Diagnostic Medicine > Imaging (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

On the Expressiveness of Rational ReLU Neural Networks With Bounded Depth

Averkov, Gennadiy, Hojny, Christopher, Merkert, Maximilian

arXiv.org Artificial IntelligenceFeb-15-2025

Forp = 3, this covers the cases of binary fractions as well as decimal fractions, two of the most common practical settings. Moreover, it shows that the expressive power of ReLU networks grows for every N up to O(logn). In the case of rational weights that are N-ary fractions for any fixed N, even allowing arbitrarily large denominators and arbitrary width does not facilitate exact representations of low constant depth. Theorem 4 can be viewed as a partial confirmation of Conjecture 1 for rational weights, as the term lnlnN is growing extremely slowly in N. If one could replace lnlnN by a constant, the conjecture would be confirmed for rational weights, up to a constant multiple. As already highlighted in Haase et al. (2023), confirmation of the conjecture would theoretically explain the significance of max-pooling in the context of ReLU networks: It seems that the expressive power of ReLU is not enough to model the maximum of a large number of input variables unless network architectures of high-enough depth are used. So, enhancing ReLU networks with max-pooling layers could be a way to reach higher expressive power with shallow networks.

artificial intelligence, machine learning, relu network, (18 more...)

arXiv.org Artificial Intelligence

2502.06283

Country:

North America > United States (0.46)
Europe (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Condorcet's Jury Theorem with Abstention

Ghalme, Ganesh, Meir, Reshef

arXiv.org Artificial IntelligenceAug-1-2024

The well-known Condorcet's Jury theorem posits that the majority rule selects the best alternative among two available options with probability one, as the population size increases to infinity. We study this result under an asymmetric two-candidate setup, where supporters of both candidates may have different participation costs. When the decision to abstain is fully rational i.e., when the vote pivotality is the probability of a tie, the only equilibrium outcome is a trivial equilibrium where all voters except those with zero voting cost, abstain. We propose and analyze a more practical, boundedly rational model where voters overestimate their pivotality, and show that under this model, non-trivial equilibria emerge where the winning probability of both candidates is bounded away from one. We show that when the pivotality estimate strongly depends on the margin of victory, victory is not assured to any candidate in any non-trivial equilibrium, regardless of population size and in contrast to Condorcet's assertion. Whereas, under a weak dependence on margin, Condorcet's Jury theorem is restored.

equilibrium, probability, voter, (16 more...)

arXiv.org Artificial Intelligence

2408.00317

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)
Asia > India > Telangana > Hyderabad (0.04)

Genre: Research Report (1.00)

Industry:

Government > Voting & Elections (1.00)
Leisure & Entertainment (0.88)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Inference for an Algorithmic Fairness-Accuracy Frontier

Liu, Yiqi, Molinari, Francesca

arXiv.org Artificial IntelligenceFeb-13-2024

Algorithms are increasingly used in many aspects of life, often to guide or support high stake decisions. For example, algorithms are used to predict criminal re-offense risk, and this prediction feeds into the determination of which defendants should receive bail; to predict a job market candidate's productivity, and this prediction feeds into hiring decisions; to predict an applicant's likelihood of default on a loan, and this prediction feeds into the decision of who should receive the loan; to predict a student's performance in college, and this prediction feeds into the decision of which students should be admitted to college; and to assign a health risk score to a patient, and this score feeds into the decision of which patients to treat. Yet, a growing body of literature documents that algorithms may exhibit bias against legally protected subgroups of the population, both in terms of their ability to make correct predictions, and in the type of decisions that they lead to (see, e.g., Angwin et al., 2016, Arnold et al., 2021, Obermeyer et al., 2019, Berk et al., 2021). The bias may arise, for example, because of the choice of labels in the data that the algorithm is trained on, the objective function that the algorithm optimizes, the training procedure, and various other factors involved in the construction of the algorithm. To understand what drives algorithmic bias, several models have been put forth that decompose the source of disparity (e.g., Rambachan et al., 2020a) or account for taste-based discrimination and unobservables in the generation of training labels (e.g., Rambachan and Roth, 2020). Regardless of whether the screening decision is based on a prediction made by a human or by an algorithm, the law recognizes two main categories of discrimination: disparate treatment, which amounts to deliberately treating an individual differently based on their membership to a protected class; and disparate impact, which amounts to adversely affecting a protected class disproportionately, no matter the intent (see, e.g., Kleinberg et al., 2018b, Blattner and Spiess, 2022, for a review of the discrimination law in the U.S). Often, as part of an effort to avoid disparate treatment, algorithms are designed so that they do not take race, gender, or other sensitive attributes as an input. Even class-blind algorithms, however, may yield disparate outcome. Crucially, there are trade-offs in the design of an algorithm between making it more fair, in the sense that it has lower disparate impact, and making it more accurate, in the sense 3 that, e.g., it has a higher probability to assign treatment to the individuals that benefit from it and to not assign it to the other individuals. Indeed, improving fairness often comes at the cost of accuracy.

algorithm, estimator, support function, (15 more...)

arXiv.org Artificial Intelligence

2402.08879

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.50)

Industry: Law (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback